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METHOD AND DEVICES FOR PRIORITIZING BliBCTRONIC MESSAGES 



FIELD OF THE INVENTION 

The present invention relates to a system and a meliiod for reducing unproductive 
electronic messages traf&c. More specifically, the invention relates to a system and 
a method to imptovc the efficiency of electronic messages tisage. 

5 BACKGROUND OF THE INVENTION 

Electronic mail (e-mail) is an indispensable tool for both internal and 
external communication in most organizations. Unfortunately , the unrestricted use 
of electronic mail, without rules or limitations, has caused e-mail traffic load to 
explode to the point v^diere it is becoming a real threat to the infirastructure and to 

10 the workforce productivity of many organizations. 

Many employees receive literally tens or even hundreds of e-mails daily, a 
great deal of which are imimportant or irrelevant, forcing them to spend a major 
part of their workday reviewing e-mails, and leaving them short and shorter of time 
for other, more important tasks. Moreover, too often important messages get buried 

15 in the e-mail barrage, and as a result, the reader fails to respond or act upon them. 

Many techniques have been proposed for filtering and categorizing e-mail 
messages. Some of thCTi are simple rule based methods that take simple actions 
such as automatic response, relying on parameters such as the sender's identity and 
a basic key word search (as described, for example, in US Patent No. 5,555,346). 

20 Others recommend more complex automatic actions, such as scheduling an 
appointment (as described in US Patent No. 6,553,358) or redirecting the message 
to an alternative communication device (e.g. US Patent No. 6,499,021). Many prior 
art systems perform an analysis of the message text, using machine-learning 
algorithms, and assign it to one of a predetermined set of categories. Some prior art 
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tedmiques take assistance ftom additional data such as the sendor^s identity being a 
member of the ^jcruly ot a woik associate etc 

A drawback of knowa techniques is liieit: poor ability to predict the 
koportaace of tecdved e-inmls to the redpient and what is lixe right order to 

5 process the massive amount of messa^ in the user's inbox. Moreover, for 
employees in medium and large oxgamzations^ Ibe vast msyorify of incoming 
messages oxiginate fix>m within the organization, and the importance of fliose is 
very difficult to det^mine based solely on tbsk cont^. For example, a device 
dilute in a telecommunications service provider's installation discovered by a field 

10 engineer, is reported by e^-mail to a l<mg distribution list coniprising of developm^ 
p^onnel, customer support personnel, regional sales people 2End manag^ent. The 
origtoal e-mail message triggers a long chain of messages that may include 
questtons, aiaswers, comments and personal opimons relating to the origmal 
message. These messages reach a broad distribution list usiag the femous '*reply 

15 all" feature. Many recipients gradually lose interest in the discussirau but they keep 
being copied neverthdess. The e-mails in the chain are vety hard to distinguish by 
analyzing their content, since they all mclude ihe origmal problem description, and 
they all ixiclude relevant technical ^rms. On the other hand, their subjective 
importance and relevance for different recipients may vary significantly. 

20 US 2002/0071546A1 O^tennan) published June 13, 2002 and entitled 

^Meffuxi de^he and software for processing incoming communications** discloses 
methods, devices and software for processmg incoming communicatiaDS such as e- 
mails wheretyy incoming messages and calls xnay be prioritized in accordance with 
file rank of the message or call odgpbiator within the organizatiDn. This m^ be 

25 effected by queiying an cngacization chart for tibie CHganizaticni upon receipt of an 
incomiog .ccmminnication, in order to assess the rank of the origbaator. The 
c»sanization chart may be stored in a directory servei; and queried by a computing 
device receiving the message or processing the caD * 

tot all embodiments, higher priority is given when the originator is of higher 

3 0 rank than the recipient. In one variation, the detTOnining metric is the total distance 
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within the otgajiizadonal hierarcliy betweCTi a supervisor who is common to tlie 
Qsigmator and xecipieoL This variation also accords higher priority to a message 
whose Qri^nator is sup^or in rank than the recipient 

WC)0180535A1 published October 25, 2001 and entitled ''(ikmrnvmcations 

5 Prioritizet^ discloses a method of prioritizing a received inforroaiion message in 
which the circumstantial origin of the message is incficated by a petsonalxsed 
idenfiUSw flccompansying or d^ved firom the message in i^ard to e-mail or ofli^ar 
communications systems- The meOiod includes the elements of receiving the 
message detemuning the personalized identifier, looklng-i:^ and ooss^re&rencing 

10 the personalized identifia: to a database of known personalized id^tbSer and 
priority codeSj, assigning a priority code to the message per the result of the element 
of looking-iq) and cross-referendng, and prioritizing (including categorizing, 
sorting, redirecting, erasing, or otherwise acting tqpon) the received message 
according to toe priority code. 

15 This publication too is based on the rank of the SOTiding party* and dow not 

take into account the rank of the recipient 

SUMMARY OF THE INVENTION 

It is an objectiye of the invention to provide a method and system that allow 

incoming messages to be pre^sorted or pre-tagged based on their relative 
20 hnportance as defined by pre-defined criteria, so as to allow the recipient to attend 

fiist to ttiose received messages that are mcely to be most urg^. 

The present invention addresses this objective by providing a method and 

system for assigning importance classes to electronic messages. The teem 

^electronic messages" relates to C'-mail mess^ges^ fitcsimile mi^sages^ or to text 
25 data of converted voice messages or pager messi^es. In Ihe contsTct of the ^n^ent 

invention, the term "importance class" relates to the degree of relevance to a certain 
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recipient of a coiiiaiumcations assigned by a system usmg the metlLod to each of a 
group, consisting at least one element, of electronic messages. The tenn "assigning 
unportance classes" relates to associating eadi of a groiq), consisting at least one 
element, of electronic messages, with an importance class attribute, for example^ by 
5 means of embedding, tagging or any other acceptable linking method. 

The m^od comprises identiQong the sender of an elechtmic mess^ge^ 
idmtil^fing the recipient of the electronic m^sage, detemuning a relative 
oigatdzational distance between the sender and the recipient, and assigning the 
electronic message an inQ|»ortance class as a function that assigns the inq)aitance 

10 class T^stdle^s of vfbslliveat tiie sender and the recipient is of higjb^ rank. 

In the context of the present invention, &e femi '^relative otganizational 
distance" relates to a metric derived fiom an organization hierarchical structure. 
Specifically, in a preferred embodiment the relative organizational distance is a 
function of the level of work afSliation between flie craresponding departmmts of 

15 the message sender and the message redpient, and of the relative Merarchical level 
of said sender and receiver. In a further embodiment, this fhnction is refined 
according to one or more of the following: (a) a set of global control rules 
accordfaig to die organizational stractuie and the woik a£Sliation among different 
departments and different hi^mcbical layeis in the coxporatioin; (b) a set of control 

20 rules according to ad hoc work groins finmed fi:om time to time; (c) a global list of 
preferred originating addresses, eadmial to the oigani^cxD, 6xm sendi^ affiliated 
with the oi^anization. 

For example, a message fitnxi an individiial in the same dq)artm^ as the 
recipient is often attributed a higher uuportance than a message atriviiig from a 

23 dififerent department, or a message from the same position level or firom a direct 
supervisor or from a person directly reporting to the recipient, is athibutcd a hij^er 
importance class than a message from a sender positioned much higher or further 
down in the Im^hy. A message from an essential unit fox daily operation (e.g. a 
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message from a tesearch unit to a developmeat xmit) is attributed a higher 
iir5x>rtance than a message from non-essential unit (e.g. legal or administrative). 
The organizational rules naiay also be for a specific period of time, for exanorple, a 
project involving two development teams A and B collaborating for a specific 

5 period of time conveys higher importance on communication between the two. 

In a ftuHier embodiment, the computation of the importance class attribute is 
a wei^ted average of the relative organizational distance, and at least one 
additional criteriorifselected from the following: (a) a globally defined content 
" criterion; (b) a personally defined message sender criterion; (c) a personally. 

10 defined content criterion; (d) a pliirahty of rules formed by a machine-learning 
algorithm ox algorithms; (e) an analysis of e-mail message headers. 

The term "globally defined content criterion" relates to a pre-defined set of 
key words, terms and phrases, constituting references of "important" and 
"ummpdrtant" content items found in the text of the message body and^or the 

15 message subject field. As the word "globally" indicates, this references set applies 
for all users* messages. Similarly, the term "personally defined content criterion" 
relates to such references set of "important" Ind "undmportant" content items 
defined individually by a user, which applies only for messages arriving to that 
user's inbox. 

20 The term "machine-learning algorithms"* relates generally to cornputational 

models and techniques for automatic improvCTient of performance, based on past 
expCTience. In the context of the preset invention, such algorithms are used for 
tracing users' actions ^xpon receipt of a message; for example, opening, replying, 
forwardiQg or deleting. Crossing said actions with other data related to the 

25 respective message, for example, sender's identity, content items, etc. and 
comparing it with past information gathered, allows derivation of new classification 
rules accordingly, based on the assumption that a user's behavior consistently 
observed through time indicates of the importance ascribed by the recipient to 
received messages. 
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The tenn "analysis of e-mail message headers" relates to detection, 
interpretation and processing of a feature or features in message headers, 
specificafly, message headers such as V, "cc" and **bcc" fields. An exai^ple for 
such a feature or features is the number of the recipients in the 'Ho*', "cc" or **bcc" 
5 field. A user qipearance as flie sole- recipient- of a message may imply higher 
importance or relevance of said message to ttie recipient. 

The present invention fiirfher. relates to a method for streamlining flie 
management of electronic messages, the mefliod comprising: (a) assigning an 
10 importance class to each of said messages; and (b) streamlining said messages in a 
pre-deteimined manner in accordance with the respective importance class of each 
message. 

In a preferred embodiment, the streamlining includes displaying 
nblificktions of incoming m6ssages either in a color thaft is characteristic of flie 

15 respective icnportance class of each message, or with a distinctive tag that is 
characteristic of fhe^respectiveinq^ a pre- 

determined order, for example, in descendmg ordea:;^indicating tiieir relative 
importance in respect with their assigned inq)ortance classes. In a further 
embodiment, the streamlining includes blocking messages whose importance class 

20 is beneath a predetermined threshold, either with or witiiout alerting the sender that 
a message has been blocked. 

It is to be appreciated that the invention uses the importance level parameter 
to enable multiple, dififerent operations, for example, the user is able to differentiate 
the most important messages and focus his/her attention to those messages 

25 exclusively; Further examples are the transmission of the most important messages 
only to the e-mail client (referred to herewith as "selective synchronization"), or the 
archiving of less important ones (referred to herewith as "selective archiving"), thus 
the overall e-mail processing efl5ciency is increased. 

In the scope of the present invention is also a system for assigning 
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ixoppxtance class©? to electronic messages, said system comprising: 

(a) a message data extraction unit for identifying a sender and a xecipiqat of 
an electronic in^sage. The message data extraction unit is a series of comcputor 
instructions ad^qpted to capture an identification of a message sender and the 

5 message redpienQ and 

(b) a dassifi^ coupled to the message data extraction unit and being 
responsive to ardative orgftuiyational distance between the sender and the recipient 
ito flggijEning to the electronic message an importance class in inverse dependence 
on the relative organizational distance between Hie sender and the red^^ The 

10 classifier Is a module of compute program enable of associating identities of 
message send^ and message recipients with pre-det^nooined organizational data» 
{<Xi detexraining the relative organizational distance between respective solders and 
recipients. The classifier is further capable of calculating an importance class 
attribute of a m^sage^ according to a relative otganizationai distance between the 

IS sender and the recipient 

In a fiirther embodiment, the classifier is fiirther adapted to assigning said 
importance class based on at least one additional mterion^ selected &om the 
following: 

(a) a pre^definedxxiessagesend^ criterion; 
20 (b) a pre-defined content critedon; 

(c) a plurality of rules fbnaaed by a n:iachin&-Ieanung algorithm tcadng 
Tiser actions; 

(d) an analysis of e-mail message headers. 

According to a preferred embodhnent^ the calculation of tibie in]portance 
25 class attribute by the classifier furftier mvolves a rules formatiQn unit comprising; 

(a) a set xDf ^obal control rules rdating to an organizational structure and 
wodc affiliation among diSbr ent departm^ and dififer^t hierarchical layers thereof, 

(b) a set of control rules relating to ad hoc work groups formed firom 
tune to lime in said organizationsi structure; and 
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(c) a global list of preferred originating addresses external to the 
oiganizational structure. 

The set of rules in the rules formation mut may be either pre-detecmined (i-e. 
an iiDmutable data object s\q)pli6d by the system provider) or a dynamic product of 
5 the system (i.e. an adaptive, configurable, rule generating modules-responsive to 
user behavior, user .configuration, administratof configuration, message content and 
features analysis, or other system variables). 

In a further embodiment the rules schema encapsulated in the rules 
formation unit includes three categories, as will be explained in detail hereinafter: 
10 (a) organizational rules, (b) content dependent rules, and (c) user-behavior based 
rules. 

Organizational rules make use of the infira-structiire of the organization to 
determine the importance of the message, as will be further explained and 
illustrated in detail hereinafter. 
15 : In one embodiment, the system stores the departmental and hierarchical 
stmcture of the organization in a database, along with a definition of the work 
affiliation level between the various departments. The organizational structure 
•information can be input to the system's database using available industry tools, or 
through a Gr^hical User Interface (GUI) specific to a sj^tem according to the 
20 invention. - ^ 

Alternatively, the system may use a "skeleton" hierarchical structure which 
is based on permanent relationship between units. 

The system enables to define ad hoc groups of users who, for a specific 
period of time, may require frequent and high priority communicationjlevel 
25 regardless of their regular position in the organization. ^ 

In addition to the organizational data, referred to as "global organizational 
data", which is set by the organization, each user is allowed to defiae individual 
preferences, referred to as "personal organizational data". A person, although not 
bound by the organization, may increase, for his/her own reasons, the importance of 
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messages received jSrom a specific unit 

The content rules in the second rules category determine the importance 
- . level attribute according to the degree of correlation between the message's content 

and reference sets of '*itnportanf ' and '"unirnportanf ' content items. 
5 " - Most reference- contCTit items are global, although the system allows each 

user a limited ability to define personal reference content items, as- long as they do 

not conflict with the global definition. 

The third rules category employs a machine-learning algorithm that updates 

and refines the said predefined organiTational and content rules according to the 
10 actions taken by each user on previously received messages. The user's- actions are 

recorded by a "user-behavior agent". Those actions may be interpreted as indicative 

of the subjective importance assigned to the message. 

Each rule categoiys contribution to the classification process may vary, 

depending on weighting factors and on the confidence of each category's decision, 
15 which can be either pre-determined or dynamically assigned, for example, the 

weighting coefiBcient of the content category could be increased or decreased 

automatically according to the corresponding confidence in assigning the 

importance.:class.?Ih accordance with a preferred embodiment, the organizational 

'category (both global and personal) has the highest contribution, ie. in the range of 
20 60% to 70% of the total contribution suiru The behavior and content categories 

contribute the rest. 

.BRIEF DESCRIPTION OF THE DRAWINGS 

^ In order to understand the invention and to see how it may be carried out in 
.practice, some preferred embodiments will now be described, by way of non- 
25 .;limiting example only, with reference to the accompanying drawings, in which: 

Fig. 1 is a diagram illustrating one embodiment of a system for classifying 
incoming messages; 

Fig. 2 is a diagram illustrating in detail a classification system as featured in 
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Fig. 1; 

Big. 3 is a diagram illustrating the integration of a system as depicted in 
detail in. Fig. 1 and 2 in an overall environment adapted to the streaxnliaing of 
electronic communication n[ianagemen^ 
5 Fig. 4 iUnstrates n^detail the &-niail sender interface dqpict^ 

Fig. 5 illustrates in detail the configuration management and monitoring 
module depicted in Fig. 3; 

Fig. 6 illustrates in detail the e-mail client interfece depicted in Fig. 3 ; 

Fig. 7 illustrates in detail the user functions module depicted in Fig. 3; 
10 Fig. 8 is a diagram illustrating the integration of the environment depicted in 

Fig. 3 in an enterprise network; and ^ 

Fig. 9 depicts an Gxmnplc of the flow of execution from a message arrival to 
a user's inbox imtil the notification thereof is presented to the user with flie 
appropriate importance class indication. - 

15 

DETAILED DESCRIPTION OB^THE INVENTION 

Fig. 1 illustrates a system (400) according to the invention, also referred to 
herewith as '^classification engine", for classifying incoming messages. The 
classification engine (400) includes an extract message data unit (470) for gleaning 

20 the information required to establishment of the message's importance class, by 
meanis of applying, within the classifier (450), a set of classification rules fomid in 
the rules formation unit (480). The rules formation unit (480) is able to maintain a 
repository of pre-determmed classification rules, as well as generating new ones, 
- based upon flie information obtained fcom the extract message data unit (470), and 

25 from the organizational structure data, both being transferred to rules formation unit 
(480), as shown in the drawing. Once the message's importance class is established 
by the classifier (450), it is then assigned to the message by the classifier (450). 
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Fig. 2 describes in detail the classification engine (400) of Fig. 1. The 
classification engine (400) includes a rules generator (410) which receives 
organizational data from the oig chart application (30), possibly througih drivers 
(80) or other intermediating software, and configuration parameters &Gm the 
5 S3^tem. administrator and from the users. The user behavior agent (420) adaptively 
derives rules-based on individual message handling. 

The rules are stored in the rules database (430). The classifier (450) uses the 
rules firom the database, along with the message features siqpplied by a feature 
extractor (440) and contrat items supplied by a text parser and analyzer (460), to 
10 determine the importance of each message. The rules generator (410), the user 
behavior agent (420) and the rules database (430) together constitute the rules 
formation unit (480) shown ia Fig. 1. The feature extractor (440) together by the 
text parser and analyzer (460) constitute the extmct message data unit (470) shown 
iuFig. 1. 

15 

Fig. 3 illustrates the integration of a classification engine (400) as depicted 
in detail in Fig. 1 and 2 in an overall envir^imient (10), also referred to herewith as 
a "system", adapted to the streamlining of electronic communication management 
In the system ( 10) the classification engine (400) is coupled to a user fimctions 

20 module (500) which mcapsulates the implementation of user related fimctions, 
such as the selective synchronization and the selective archiving fimctions. The 
classification engine (400) and the user fimctions module (500) are both coupled to 
an e-mail server interface (200) which serves for the transfer of e-naail messages as 
well as control and status messages between the e-mail server and the components 

25 in the environment (10) interacting with it^The user fiincticns module (500) is also 
coupled with a configuration, management and monitoriag module (100) which 
enables the system administrator to manage and configure the operation of the 
system (10) as well as to monitor its status and perfoimance. The configuration, 
management and monitoring module (100) also allows for each individual user to 

10 
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modify the classification rules encapsulated in the classification engine (400), as 
well as selected modes of operation, for example, with the system (10) enabled or 
disabled, according to one's personal preferences. In further embodiments the 
configuration, management and monitoring module ( 100) conoprises an interface 
5 for accessing configuration, management or rnonitoring functions over standard 
TCP/IP communication transport chaimels, such as HTTP (Hyp^-Text Transfer 
Protocol), using a standard web browser application (70). 

The Org Chart ^plication (30) encapsulates the organization chart, which is 
extracted from it Ihrongh the drivers (80). In further embodiments the Org Chart 
10 application (30) includes tools for building a database describing the orgariizational 
structure, and for updating said database's records according to permanent or 
temporary changes in this structure. 

It is to be appreciated that direct interaction between the cUent coniputer and 
Ihe system.components is reduced to a miniminn necessary. 

15 

Fig. 4 depicts in detail the e-mail server interface (200) shown in Fig. 3 . The 
functionality of the e-mail server interface (200) Gomprises-rjaonitoring message 
traffic inside the e-mail server, interception of incoming and outgoing messages, 
transferring those messages or copies thereof to -other modules featured in the 

20 enviroimient (10) of Fig. 3, such as liie classification engine (400) and the user 
functions (500), and sending the messages back to the e-mail server after or during 
their processing procedure by the different units of system (10). 

The traffic monitoring unit (220) detects messages arriving to users* inboxes 
hosted on the e-mail server, and activates the Message Intercept and Return module 

25 (210), which posts them in the message buffer (23Q) for further processing by the 
classification engine (400). The message buffer (23X)) dequeues importance-lagged 
messages after being processed by the classification engine (400), and places them 
in the recipient's mailbox with the proper importance attribute. 
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Fig. 5 illustrates in detail the configuration, management and monitoring 
module (100) depicted in Fig. 3. The configuration, management and monitoring 
module (100) allows for system adnmnstrators to configure a plurality of 
parameters of the system (10) in Fig- 3 for optimal perfomiance, and to adapt said 

5 system for specific needs of a certain organization. In addition the module (100) 
enables the administrator to monitor the performance of the system (10) and to 
perform fault tracking. E-mail users can also utilize the module (100) for personal 
preferences setting. The module (lOft) includes a configuration database (140) for 
maintaining the administrator's and the users' settings, and a performance database 

10 (130), for maintaining the performance data, such as messages distribution, 
xnalfimctions and exceptions, usage monitoring and other reports, as well as a list of • 
logging parameters. The module (100) further sxipphes graphical tools for said 
configuration, management and monitoring purposes, in the means -of an 
administrator GUI (110) -and a user GUI (120), as will be explained and illustrated 

1 5 in detail hereinafter 

Fig. 6 describes in detail the e-mail client interface (300) depicted mJFig. 3. 
The e-mail client interface (300) includes ad agent controller (320) that controls a? 
behavior agent (40) sitting in the e-mail client (50) which captures the user actions 
20 on received messages. The behavior agent (40) can be remotely installable and 
executable. The agent controller (320) receives user action information in addition 
to a designator to tiie corresponding message, and stores them in the user's actions 
database (3 10) for use by the classification engine (400). 

25 Fig. 7 describes in detail the user fimctions module (500) depicted in Fig. 3. 

The user functions module (500) includes a selective synchronization module (520) 
which allows e-mail clients to limit their inbox exclusively to e-mails with a 
predefined importance level, and a selective archiving module (510) which allows 
archiving messages residing in the user's inbox, whose importance class is beneath 
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a predetermined threshold. The selective archiving module (510) also enables 
convenient, easy search and retrieval of previously archived messages. 

Specifically, the selective synchronization module (520) allows a selective 
transmission of e-mail messages from the e-mail server*s inbox to the client 
5 coinputer's inbox, according to a predefined iiiq)ortance level. - - 

The selective mailbox synchronization procedure in a system according to a 
preferred embodiment of the present invention is as follows: 

(a) The traflBc monitoring unit (220) in^ the e-mail server interfece 
module (200) receives a '"request for synch'* event from the e-mail server 

(b) The s elective s ynchronization m odule ( 520) c hecks t he i mportance 
tag of each message before its transfer to the client computer, and approves or 
prevents the transfer according to the corresponding importance tag. 

- Each e-mail user sets the following parameter that affects Ihe fimctionality 
15 of the selective synchronization module (520): 

Synchronization^importance (messages of this importance level and above 
is transferred to the chent) 

The selective archiving module (510) allows "smart archiving" of previously 
20 received messages, by means of packing groups of e-mails residing in a user's 
inbox into archives according to their importance class, and the time elapsed since 
they were received. 

The Smart Archiving ftinction for each user is activated based on the^ 
preferences selected by the user: J 
25 Enable/disable archiving - ? 

Max_number_of jtnails in inbox (start archiving after niimber is exceeded) - 
MaxJbabox_memory_size (start archiving after size is exceeded) 
Archive^importance (archive messages at this importance level or lower) 
Archive_agejresolution (archive messages older than this value) 
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Destination folder names 

Archived Messages are stored in cx)nq)ressed format in order to save storage 

space. 

Fig. 8 illustrates the integration of the system (10) in an enterprise netwod^ 
5 according to one embodiment of the present invention. The enterprise network is a 
model of a working environment in which different computer programs and ^ 
applications are running on a distant machine external to a user's computer and are 
accessed by the user through a web browser. The main server (710) runs the major * 
components of the system ( 10), such as the classification engine (400), the user 
10 functions m odule ( 500), and o flier r elated functions. The e -mail s erver ( 730), a s 
well as the management console (720) and the users' conq)uters (740H760) are all 
connected to the main server (710) through the enterprise network as shown in the 
drawing, thus enabling access to the configuration, management and monitoring 
module (100) of the system (10), usihg the graphical tools supplied by the module 
15 (100), such as the administrator GUI (110) on the management console (720), and 
the usCT GDI (120) on.ihe users* computers (740)-(760). 

Fig. 9 depicts an example of the flow of execution ficom arrival of a message 
to a user's inbox until the notification thereof is presented to the user with the 

20 appropriate importance class indication. In step 800, the traffic monitoring unit 
(220) in the e-mail server interface (200) gets an event on a new message being 
stored in a users* inbox, i.e. an incoming message. In step 810, the trafBo 
monitoring unit (2^0) determines whether the respective message related with said 
event is, indeed,-tiew. If the message is new, in step 820 the message intercept and 

25 return unit (210) then copies the message and stores it in the message buffer (230), * 
firom which the classification engine (400) dequeues the message in step 830, sets 
its importance tag in step 840 and restores it ia the message buffer (230). Finally, 
the message is returned to the mail server (overwriting a previous importance field, 
if any, or the entire message). In a preferred embodiment, the message is further 
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used, in step 850, for adaptive algorithm traiiiiag, as wiQ be explained in detail 
hereinafler, eventually resulting, in step 860, in the itpdate of classification rules 
maintained in the rules database (430) accordingly, as shown in the drawing. 

It is to be £5>preciated that the method according to the present invention is 
5 being implemented on a copy of the message that is external to a central repository 
^ on which incoming messages are stored so as to enable unintemxpted service in the 
case that said method fails to qperate or malfunctions, and furthermore to avoid loss 
^' of messages or messages parts. 

. In a preferred embodiment, the messages are stored by the e-mail server in 
10 the^ user's inbox concurrently with being processed by the system (10). After an 
importance class of a message has been determined the respective importance 
attribute of the message is updated accordingly. 

Alnessage stored in the message bufifer (230) for classification is processed 
15 first by the feature extmctor (440) and by the text parser and analyzer (460). 

The text parser and tmaiyzer (460) extracts content items from the message 
subject field and from the message bSdy and passes tiiem to the classifier (450) and 
to the user behavior analyzer (420). Content items are a list of key words, terms and 
phrases found in the text The extraction of content items can be performed using 
20 known techniques such -as in Schweighofer and Winiwarter, **Refining the 
selectivity of thesauri by^means of statistical analysis", in Intl. Congress on 
Terminology and Knowledge Engiaeering, 1993. 

The feature extractor (440) extracts the following features from each 
25 • message and passes them to the classifier (450) and to the user behavior analyzer 
(420): 

Sender*s e-mail address and/or nickname 
Recipient's e-mail address and/or niclcaame 
Number of recipients in the '*to*' field 
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NiimbCT of recipients in the "cc*' or *T5cc" field 

The classifier (450) applies rules &om all categories in order to det^mine 
flie message in^rtance level. The rules are drawn firom the rules database (430). 
5 . The foUowing categories of clas^catLoiTrules are applied, as will be explained in 
detail hereinafter: 

(a) organizational 

(b) ^content 

(c) behavioral (adaptive) 

10 

The final importance level is a result of the weighted average of the outputs 
of all rule categories. The weighting factors are a function of predetermined values 
as well as the corresponding confidence level. The classifier contains a collection of 
reference "important* content items and reference *hmimportanf * content items, for 
15 conxparison with content items found in the messages . 

The importance class iadication (the iniportance tag) is attached to each 
message. If the importance class is '"unknown", no tag is attached to that message. 

20 The implementation of the importance tag enables a clear display of the 

iniportance class, by means of different colors or any other clearly visible tag, and 
allows "importance based" sorting in the e-mail client 

Four Importance classes are defined: high, mediimi, low, unknown. 
25 Additional classes can be added upon need; 

The final importance class is determined as a weighted average of the 
outputs of all rules categories. The output of each rule category is assigned a 
confidence level. If none of the conditions of a certain rule is fulfilled then the 
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rule's output is: in5)ortance===uiilaiowEu The final impcataace class is calculated 
according to the foUowiag formula: 

(yW(i)*IC(i) ^ - 
Finaljmportance == INT r^ 

Where: - - ^' 

5 IC(i) — importaace class detennined by rule category i, where IC = 1 for low 

importance, 2 for medium in5)ortance, 3 for high iroportance, 0 for unknown 
importance. ^* 

W(i) — weighting factor of rule category !, where 0<W<1. 

10 The weighting factor for each rule category is calculated as follows: ■ 

W = W_const*CF 
Where: 

15 W_const — the constant weighting factor for that category as conjSgured by 

the admhustrator. ^ 

CF = confidence fector for that category (CF assumes values between zero 
and one, where a zero designates no confidence ^d a one designates fiill 
confidence). • - 

20 ^ 

All weighting coeflBdents, dedsion thresholds and other constants are 
configurable through the administrator GUI (110). A limited number of parameters 
are configurable through the user GUI (120). ^ 

25 The "organizational'* rules category may include rules as in the foUowiag 

Hst: ' 

If <orgamzational distance=low> then < importance=high> 
If <orgamzational distance^TOediun^> then < importance=medh 
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If <c«gamzatioital distance=4ugh?> liiea < importeince=low> 
If <sea<k3: & xedpient belong to workgroup> tib.«G <inxportaiice?=liigh> 
If oddress is exteacaal and address bdoxigs to prefiarred list> th^ 
<ii3ipQrtance^high?^ 

5 

Tbe Qtganizatioiial distance is calculated as foUows: 
Q)cg_dis« depjdis + hear_dis 

Wheire: depjdi$ is the d^artmental distance and berjdis is the hierarcha] 
10 distance, 

Depjdis can assume 3 values - 0,1^ (where 0 CQjoresponds to -fte same 
ctepartment). 

H«.dis can assume the values - 0,1^3^.... (where 0 correspottds to the 
same hicacarchal layej^ 1 corresponds to 4/- 1 level difterence etc.). 

15 

Oiganiz^tional distance is defined as follows: 
If (orgjdis Tl) then (organizational distance^ low) 
If (org^dis > Tl and <= T2) fbm (organizational distance = medium) 
Else (organizational distance hi^) 
20 Default values: Tl=l5 T2=2 

Eadbi user may define a set of personal pref^nces telating to senders 
int^Doal or external to the oxganization. Such user's personal se^ttings can only 
increase fte in^rtance set by the administzaton For exanrple^ personal pie&reQces 
25 may be of tile following ^es: 

(a) If <sender belongs to preiGsrredJbtemaljaul^^ tfien 
<importaxxceF==higl(> 

(b) If <:55ender belongs to prefetred_extemal_address> flien 
<importaiiceHbigfct> 
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The system allows setting importance classes according to a global list of 
preferred originating addresses, extemal to the oTgani2ation, from senders aflBliated 
wifli the organization, such as customers, suppliers, partners, etc. The list of 
5 preferred extemal addresses is defined by the system administratar or is drawn firom - 
the data bases of existinjg enterprise plications such as ©aterprise resource 
plamnng (ERP) or customer resource management (CRM). 

The system allows the setting of inxportance classes according to the SMTP 
10 (Simple Mail Transfer Protocol) message headers such as "cc" and 'Tjcc" 
fields. Following are some exan^les for such rules: 

If <recipieiit alone ia 'to" header> then <importance=high> 
Else If <# of recipients in *to" header less then Nl> then 
<impoftanceF=medium> 
15 If <recipient alone in "cc" header> flien <importance=medium> 

If <recipient alone in **bcc*' header> then <rmportancer=high> 
Hse <importance==unknown> 

The importance class set by the organizational category is according to the - 
20 followrug rule: - ^ 

If at least one rule voted 'Tbigh" then IC=high 
Else if at least one rule voted **medium" Ihen IC>=medium 
Else if at least one rule voted "low" then lOlow --r 
Else IC=uQknown 

25 ' ■ 

The confidence level of the organizational rule category is calculated as,. 

follows: 

If one rule voted for the selected importance class then CF = CF jtnin 
If two rules voted for the selected importance class then CP = CF_med^ 
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If three or more rules voted for the selected inrportance class then CF = 
CFjnax 

Defeult values: CFjiiiii===0.7, CF_inedii2in==0.85, CFjaiax=l 

5 The function of the content based rules is to assist in classifying e-mail 

messages on two levels: 

(a) Higher rating of relevant work-related content 

(b) Lower ratmg of irrelevant content such as jokes, music files^ video- 
ffles, solicitation etc. 

10 There are niimerous prior art methods for classifying text messages 

according to their content (one such method is described in US patent 6,519,580). 
One simple method is a simple '"search and coimf operation of given reference 
keyword. The present invention is not limited to any specific text classification 
-method. ' 

15 

The f<^wing content rules are ^plied: 

If <messagejsubjeGt_contentj[tem belongs to irrelevant_class> then 
<in^ortance^low> (multiple irrelevant classes are supported) ELSE If 
<message__subject_content_item belongs to relevant_class> then 
20 <importance=high> (multiple irrelevant classes are supported). 

If <message_bodyjcontentJtein belongs to irrelevant__class> then 
<importance^low> (multiple irrelevant classes are supported). ELSE If 
<message_body-^contentJ[tem belongs to relevaiit_class> then <importanceF=high> 
25 (multiple krelevant classes are supported). 

Most "reference content-classes" are defined by the system administrator. A 
limited number of "personal reference content classes'* can be defined by each user. 
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provided that they do not conflict with administrator defined classes. For example, 
if a user specifies a key word as "imimportanf' and the same keyword was akea<fy 
defined by the administrator as **importanf the system rejects the personal setting. 

5 Most prior art methods for classifjdng text messages also generate, a 

confidence indication that can be used for calculation of the final importance class. ' 
For a simple key word "search and counf ' method the following sinaple algorithm 
can be used to estimate tiie confidence factor of ftie content rule category: 

If # of matching key words is larger then Njnin but smaller then 
10 N_mediumthenCF = C3''jmin 

If # of matching key words is larger then Njnedium but smaller then 
Njtnax then CF = CF_medium 

If # of matching key words is larger then N_max then CF = CF_max 

15 Defatdt N values: Nmin=2, Njnedium==4, N_max=6 

Default CF values: CF_min=0.7, CF_medium=0.85, CF_max=l 

Adaptive rules are generated based on Hie user's actions taken on previously 
received e-mail messages. The behavioral information is recorded for all e-mail- 

20 .receiving users as defined by the system administrator. The behavioral information 
for each user is recorded on a statistically sufficient number of messages (several 
hundreds). The behavioral iofoxmation is used to deduce Ihe message importance. 
This information is used'' to produce rules that relate various message attributes to 
the importance level of messages. Those rules are used in the process of classifying 

25 new messages. The ^.classification outcome of the adaptive algorithm is 
accompanied by a corresponding confidence factor estimation that is used for the 
calculation of the final importance class as described above. 

The following '^iser actions'* taken on received e-mail messages are 

21 \ ^ . 
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recorded by the **user behavior agenf , causing Tnininial interference with tiie client 
computen 

(a) Hme elapsed from the moment the m^sage was stored in the inbox 
till it was opened by the user 
5 (b) Time elapsed from the moment ns^ started e-mail activity (after the 

message was storedin the inbox) till it was opened 

(c) Time during which the message remained open 

(d) Replying to the message 

(e) Forwarding the message 

10 (f) Filing or saving the message 

(g) Deleting the message * - .? 

(h) Printing the message 

The followiag importance criteria are applied: 
15 - If<timeJ:o_open=shore>then<importance=high> 

If <timeJx)_open_^since_activity_started==shor£> then <importance=higli> 

If <timejtnessage_jopened=long or time_attachmentj>pen=long> then 
<inq)ortance=high> .* 

If <message replied> then <iniportanc^high> 
20 If <message forwarded and j51ed/saved> then <importance=highP> 

If <message printed> then <importance=high> 

If <message forwarded> then <importanceF=medinm> 

If <message j51ed/saved> then <importance=medinn^ 

If <message deleted> then <nnportance=low> 

25 

The final importance class is determined^as a weighted average of the above 
criteria, according to the following formula: 



IC 
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10 



The following attributes are extracted and rated: 

(a) Message sender 

(b) Organizational distance between sender and recipient 

(c) "Subject field content - . - 

(d) Message body content 

(e) Recipient alone in **to" header field 

(f) Recipient alone in "cd- or '*bcc" header field 

(g) Or a combination of two or more of the above 



The adaptive, behavioral based, rules are generated using, for exaii5>le, prior, 
art techniques such as: 

(a) Naive Bayes 

(b) Rule learning or machine learning Algorithms 
15 (c) Support Vector Machine 

Adaptively generated rules have a limited, , configurable, validity time 
(training is performed over the last N days). The algorithm is appKed contoously 
in order to adjust for dynamic conditions. . 

20 The system allows the adoiinistiator to define the orgajiizational structure 

nsing a Graphical User Interface (GUI), or to import it from an existing enterprise 
database. After defining/importing the org chart, Ihe administratoT is able to 
subscribe employees to the invention's services. This is done by clicking the 
"subscribe users" button, and flien selecting firom the org ghart the following 

25 options: * . 

(a) The entire company 

(b) Whole departments 

(c) Whole hierarchallayers 

(d) Individual users 
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For all subscribed employees, the system searches the corporate database 
and retrieve Iheirpeisonal details (e-mail address, nickname). For names not foxmd, 
the system prompts the administrator to manually enter the corresponding data. 

5 The system provides a graphical tool for convenient defmition of fhaorgjlis 

parameter. The tool is jqpplied to the standard^graphical view of the oig chart 

Dep_dis between departments is defibaed hy marking a distance betwe^ two 
departments by clicking them one after another. The GUI prouopts the nser to 
10 choose the distance value, for both directions (message sent from one department to 
the second and for the opposite direction). A definition of distance between two 
departments also apphes to all their sub departments. 

Specifying a distance between sub departments (sub department may consist 
' also an individual in a department or a group of individuals in a department) 
15 ovOTides the distance deJBned between the parent departments. 

A distance is marked between two mdividuals by ijelecting the two. An 
individual distance definition overrides the distance defined between tiie^ 
departments and/or the sub departments (for said individuals only). 

20 The system allows Ihe selection of a group, of individuals that belopg to a 

workgroup (with or without a time limit). For all individuals who belong to the 
workgroup - org_dis = low. The system alerts the administrator T before expiration 
of the workgroup vahdity period (T= 1 week). For woikgroup definition the system 
treats department managers as individuals and not as representatives of their 

25 departments. • - 

TV 

The add/remove users screen allows the administrator to add or remove 
employees' subscription to the invention's services after the initial system 
installation. After selecting the add/remove button the administrator is presented 
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with the org chart screm, where he is able to add/remove/change the aiser in the oig 
chart, and then subscribe/unsubscribe him in the procedure described above. 

Data on the perfomaance of the invention is logged in the database for 
5 history recording, oflQine performance analysis, performance improvements and - . 
user behavior profiling. The system allows for easy plication of various statistical 
analysis operators (average, standard deviation, histograms, correlation etc.) and 
graphical presentation of the results. A partial list of parameters for logging may 
include: * * * r 

10 (a) Distribution of messages according to their hnportance level (per 

corporate, department or individual user). 

(b) Correlation between rules categories and specific rules results, and 
the fibaal importance setting. 

(c) Correlation between user behavior and the final importance setting 
15 (per corporate, department or individual user). 

(d) Monitoring of changes noiade by users to their personal preferences. 

(e) The behavior ofthe above data over time (daily, weekly, monthly and ^ 

yearly resolution). 

It will also be understood that the systOTi according to the mvention may be 
20 a suitably programmed computer. Likewise, the invention contemplates a computer 
program being readable by a con:5)uter for executing the method ofthe invention. 
The invention further contemplates a machine-readable memory tangibly ^ 
embodying a program of instructions executable by the machine for executing the ^ 
method of the invention. # 
25 Eti the method claims that follow, alphabetic characters and Roman numerals 

used to designate claim steps are provided for convenience only and do not imply 
any particular order of perfornaing the steps. 
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